Parametric fractional imputation for mixed models with nonignorable missing data

نویسندگان

  • Shu Yang
  • Jae-Kwang Kim
  • Zhengyuan Zhu
چکیده

Inference in the presence of non-ignorable missing data is a widely encountered and difficult problem in statistics. Imputation is often used to facilitate parameter estimation, which allows one to use the complete sample estimators on the imputed data set. We develop a parametric fractional imputation (PFI) method proposed by Kim (2011), which simplifies the computation associated with the EM algorithm for maximum likelihood estimation with missing data. We first consider the problem of parameter estimation for linear mixed models with non-ignorable missing values, which assumes that missingness depends on the missing values only through the random effects, leading to shared parameter models (Follmann and Wu, 1995). In the M-step, the restricted or adjusted profiled maximum likelihood method is used to reduce the bias of maximum likelihood estimation of the variance components. Results from a limited simulation study are presented to compare the proposed method with the existing methods, which demonstrates that imputation can significantly reduce the nonresponse bias and the idea of adjusted profiled maximum likelihood works nicely in PFI for the bias correction in estimating the variance components. Variance estimation is also discussed. We next extend PFI to generalized linear mixed model and the flexibility of this method is illustrated by analyzing the infamous salamander mating data (McCullagh and Nelder, 1989).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A semi-parametric approach to fractional imputation for nonignorable missing data

Parameter estimation with nonignorable missing data is a challenging problem in statistics. Fully parametric approach for joint modeling of the response model and the population model can produce results that are very sensitive against the failure of the assumed model. We consider a more robust approach of modeling by describing the model for the nonresponding part as a exponential tilting of t...

متن کامل

Use of auxiliary data in semi-parametric spatial regression with nonignorable missing responses

We propose a method for reducing the error of the prediction of a quantity of interest when the outcome has missing values that are suspected to be nonignorable and the data are correlated in space. We develop a maximum likelihood approach for the parameter estimation of semi-parametric regressions in a mixed model framework. We apply the proposed method to phytoplankton data collected at fixed...

متن کامل

Likelihood-based Inference with Nonignorable Missing Responses and Covariates in Models for Discrete Longitudinal Data

We propose methods for estimating parameters in two types of models for discrete longitudinal data in the presence of nonignorable missing responses and covariates. We first present the generalized linear model with random effects, also known as the generalized linear mixed model. We specify a missing data mechanism and a missing covariate distribution and incorporate them into the complete dat...

متن کامل

Imputation for nonmonotone nonresponse in the survey of industrial research and development

Nonresponse in longitudinal studies often occurs in a nonmonotone pattern. In the Survey of I ndustrial Research and Development (SIRD), it is reasonable to assume that the nonresponse mechanism is past-value-dependent in the sense that the response propensity of a study variable at time point t depends on response status and observed or missing values of the same variable at time points prior ...

متن کامل

Dealing with missing data in the Center for Epidemiologic Studies Depression self-report scale: a study based on the French E3N cohort

BACKGROUND The Center for Epidemiologic Studies - Depression scale (CES-D) is a validated tool commonly used to screen depressive symptoms. As with any self-administered questionnaire, missing data are frequently observed and can strongly bias any inference. The objective of this study was to investigate the best approach for handling missing data in the CES-D scale. METHODS Among the 71,412 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013